Evolutionary Computation in Data Mining

نویسندگان

  • Ashish Ghosh
  • Lakhmi C. Jain
  • Jose Ramon Cano
  • Francisco Herrera
  • Manuel Lozano
  • Matthew G. Smith
  • Riyaz Sikora
  • J. STAŃCZAK
  • Xiju Fu
  • Lipo Wang
  • Qi Yu
  • Kay Chen Tan
  • Tong Heng Lee
  • Minh Ha Nguyen
  • Hussein Abbas
چکیده

" Evolutionary Computation in Data Mining " edited by Ashish Ghosh and Lakhmi C. Jain, published in the series on Studies in Fuzziness and Soft Computing by Springer contains twelve skillful papers, which show the advances and state of the art in the domain of evolutionary methods' application in data mining. Data mining is a relatively broad domain of automatic knowledge discovery from databases and is now one of the most developed fields in the area of artificial intelligence. A rapid growth of data collected in various realms of human activity and their potential usefulness in these realms requires appropriate and efficient tools to extract and use of the potentially gathered knowledge. Evolutionary algorithms are very powerful tools and can be utilized in many ways to make use of knowledge hidden in terabites of data collected in databases all over the world. In the Chapter 1 by Ashish Ghosh some essential aspects of application of evolutionary computation methods for rule generation are discussed. focuses on data preprocessing methods meant to minimize the size of data to be processed. Data reduction is obtained using a CHC evolutionary algorithm with stratification strategy to select representative instances of the processed data set. The presented evolutionary method outperforms the non-evolutionary ones and gives better instance reduction rates, higher accuracy and lower resources consumption. The next chapter by Matthew G. Smith and Larry Bull presents also a preprocessing method, where genetic programming is used to feature construction (inferring new features enables to find hidden relationships between them) and a genetic algorithm for feature selection from the set produced by the first stage. The preprocessing method with the C4.5 classification algorithm works significantly better than the single C4.5 method. Also the fourth chapter by Riyaz Sikora presents a multi-agent based method of feature selection and inductive learning. Genetic algorithms are treated as learning agents to discover solutions of small subproblems into which the whole data set is decomposed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...

متن کامل

Estimation of LPC coefficients using Evolutionary Algorithms

The vast use of Linear Prediction Coefficients (LPC) in speech processing systems has intensified the importance of their accurate computation. This paper is concerned with computing LPC coefficients using evolutionary algorithms: Genetic Algorithm (GA), Particle Swarm Optimization (PSO), Dif-ferential Evolution (DE) and Particle Swarm Optimization with Differentially perturbed Velocity (PSO-DV...

متن کامل

An Evolutionary Data Clustering Algorithm

Data mining is the process of deriving knowledge from data. The data clustering is a classical activity in data mining. Clustering is the process of grouping objects together in such a way that the objects belonging to the same group are similar and those belonging to different groups are dissimilar. In this paper we propose a method to carry out data clustering using Evolutionary Computation. ...

متن کامل

Functional Brain Imaging with Multi-objective Multi-modal Evolutionary Optimization

Functional brain imaging is a source of spatio-temporal data mining problems. A new framework hybridizing multi-objective and multimodal optimization is proposed to formalize these data mining problems, and addressed through Evolutionary Computation (EC). The merits of EC for spatio-temporal data mining are demonstrated as the approach facilitates the modelling of the experts’ requirements, and...

متن کامل

بررسی نقش عوامل مؤثر بر فراوانی حوادث در لوله‌های اصلی آب رسانی ‌با استفاده از الگوی رگرسیونی ترکیبی

A water distribution network is one of the important parts of infrastructure systems. The efficient management and proactive planning of capital investment of these assets are fundamental for efficient and effective service delivered by water companies. The direct economic costs (i.e. rehabilitation investment, repair costs, water loss, etc.) as well as indirect costs (i.e. service and traffic ...

متن کامل

Evolutionary Computation and Data Mining

Future products and processes will be impacted by biology and information technology. The developments in molecular and DNA computing may revolutionize the information technology. In this paper, the product and process design challenges are discussed. Evolutionary computation and data mining are two major tools that will cope with these challenges. The basic background of the two tools as well ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008